44 research outputs found

    Lyndon Array Construction during Burrows-Wheeler Inversion

    Get PDF
    In this paper we present an algorithm to compute the Lyndon array of a string TT of length nn as a byproduct of the inversion of the Burrows-Wheeler transform of TT. Our algorithm runs in linear time using only a stack in addition to the data structures used for Burrows-Wheeler inversion. We compare our algorithm with two other linear-time algorithms for Lyndon array construction and show that computing the Burrows-Wheeler transform and then constructing the Lyndon array is competitive compared to the known approaches. We also propose a new balanced parenthesis representation for the Lyndon array that uses 2n+o(n)2n+o(n) bits of space and supports constant time access. This representation can be built in linear time using O(n)O(n) words of space, or in O(nlogn/loglogn)O(n\log n/\log\log n) time using asymptotically the same space as TT

    Metagenomic analysis through the extended Burrows-Wheeler transform

    Get PDF
    Background: The development of Next Generation Sequencing (NGS) has had a major impact on the study of genetic sequences. Among problems that researchers in the field have to face, one of the most challenging is the taxonomic classification of metagenomic reads, i.e., identifying the microorganisms that are present in a sample collected directly from the environment. The analysis of environmental samples (metagenomes) are particularly important to figure out the microbial composition of different ecosystems and it is used in a wide variety of fields: for instance, metagenomic studies in agriculture can help understanding the interactions between plants and microbes, or in ecology, they can provide valuable insights into the functions of environmental communities. Results: In this paper, we describe a new lightweight alignment-free and assembly-free framework for metagenomic classification that compares each unknown sequence in the sample to a collection of known genomes. We take advantage of the combinatorial properties of an extension of the Burrows-Wheeler transform, and we sequentially scan the required data structures, so that we can analyze unknown sequences of large collections using little internal memory. The tool LiME (Lightweight Metagenomics via eBWT) is available at https://github.com/veronicaguerrini/LiME. Conclusions: In order to assess the reliability of our approach, we run several experiments on NGS data from two simulated metagenomes among those provided in benchmarking analysis and on a real metagenome from the Human Microbiome Project. The experiment results on the simulated data show that LiME is competitive with the widely used taxonomic classifiers. It achieves high levels of precision and specificity - e.g. 99.9% of the positive control reads are correctly assigned and the percentage of classified reads of the negative control is less than 0.01% - while keeping a high sensitivity. On the real metagenome, we show that LiME is able to deliver classification results comparable to that of MagicBlast. Overall, the experiments confirm the effectiveness of our method and its high accuracy even in negative control samples

    Inducing the Lyndon Array

    Get PDF
    In this paper we propose a variant of the induced suffix sorting algorithm by Nong (TOIS, 2013) that computes simultaneously the Lyndon array and the suffix array of a text in O(n) time using O(n) words of working space, where n is the length of the text and is the alphabet size. Our result improves the previous best space requirement for linear time computation of the Lyndon array. In fact, all the known linear algorithms for Lyndon array computation use suffix sorting as a preprocessing step and use O(n) words of working space in addition to the Lyndon array and suffix array. Experimental results with real and synthetic datasets show that our algorithm is not only space-efficient but also fast in practice

    Gsufsort: Constructing suffix arrays, LCP arrays and BWTs for string collections

    Get PDF
    Background: The construction of a suffix array for a collection of strings is a fundamental task in Bioinformatics and in many other applications that process strings. Related data structures, as the Longest Common Prefix array, the Burrows-Wheeler transform, and the document array, are often needed to accompany the suffix array to efficiently solve a wide variety of problems. While several algorithms have been proposed to construct the suffix array for a single string, less emphasis has been put on algorithms to construct suffix arrays for string collections. Result: In this paper we introduce gsufsort, an open source software for constructing the suffix array and related data indexing structures for a string collection with N symbols in O(N) time. Our tool is written in ANSI/C and is based on the algorithm gSACA-K (Louza et al. in Theor Comput Sci 678:22-39, 2017), the fastest algorithm to construct suffix arrays for string collections. The tool supports large fasta, fastq and text files with multiple strings as input. Experiments have shown very good performance on different types of strings. Conclusions: gsufsort is a fast, portable, and lightweight tool for constructing the suffix array and additional data structures for string collections

    Eficiência da seleção de plantas de arroz geneticamente modificado pelo herbicida glufosinato de amônia por meio da análise de PCR.

    Get PDF
    Neste estudo foi realizada a aplicação da solução aquosa do herbicida glufosinato de amônia 2% (m/v: 20 g/L) (produto comercial Liberty), a fim de selecionar plantas da cultivar BRSMG Curinga geneticamente modificadas para o gene Rubisco

    Consortium neuroscience of attention deficit/hyperactivity disorder and autism spectrum disorder: The ENIGMA adventure

    Get PDF
    Neuroimaging has been extensively used to study brain structure and function in individuals with attention deficit/hyperactivity disorder (ADHD) and autism spectrum disorder (ASD) over the past decades. Two of the main shortcomings of the neuroimaging literature of these disorders are the small sample sizes employed and the heterogeneity of methods used. In 2013 and 2014, the ENIGMA-ADHD and ENIGMA-ASD working groups were respectively, founded with a common goal to address these limitations. Here, we provide a narrative review of the thus far completed and still ongoing projects of these working groups. Due to an implicitly hierarchical psychiatric diagnostic classification system, the fields of ADHD and ASD have developed largely in isolation, despite the considerable overlap in the occurrence of the disorders. The collaboration between the ENIGMA-ADHD and -ASD working groups seeks to bring the neuroimaging efforts of the two disorders closer together. The outcomes of case–control studies of subcortical and cortical structures showed that subcortical volumes are similarly affected in ASD and ADHD, albeit with small effect sizes. Cortical analyses identified unique differences in each disorder, but also considerable overlap between the two, specifically in cortical thickness. Ongoing work is examining alternative research questions, such as brain laterality, prediction of case–control status, and anatomical heterogeneity. In brief, great strides have been made toward fulfilling the aims of the ENIGMA collaborations, while new ideas and follow-up analyses continue that include more imaging modalities (diffusion MRI and resting-state functional MRI), collaborations with other large databases, and samples with dual diagnoses

    Consortium neuroscience of attention deficit/hyperactivity disorder and autism spectrum disorder:The ENIGMA adventure

    Get PDF
    International audienc

    Brain imaging of the cortex in ADHD: a coordinated analysis of large-scale clinical and population-based samples

    Get PDF
    Objective: Neuroimaging studies show structural alterations of various brain regions in children and adults with attention deficit hyperactivity disorder (ADHD), although nonreplications are frequent. The authors sought to identify cortical characteristics related to ADHD using large-scale studies. Methods: Cortical thickness and surface area (based on the Desikan–Killiany atlas) were compared between case subjects with ADHD (N=2,246) and control subjects (N=1,934) for children, adolescents, and adults separately in ENIGMA-ADHD, a consortium of 36 centers. To assess familial effects on cortical measures, case subjects, unaffected siblings, and control subjects in the NeuroIMAGE study (N=506) were compared. Associations of the attention scale from the Child Behavior Checklist with cortical measures were determined in a pediatric population sample (Generation-R, N=2,707). Results: In the ENIGMA-ADHD sample, lower surface area values were found in children with ADHD, mainly in frontal, cingulate, and temporal regions; the largest significant effect was for total surface area (Cohen’s d=−0.21). Fusiform gyrus and temporal pole cortical thickness was also lower in children with ADHD. Neither surface area nor thickness differences were found in the adolescent or adult groups. Familial effects were seen for surface area in several regions. In an overlapping set of regions, surface area, but not thickness, was associated with attention problems in the Generation-R sample. Conclusions: Subtle differences in cortical surface area are widespread in children but not adolescents and adults with ADHD, confirming involvement of the frontal cortex and highlighting regions deserving further attention. Notably, the alterations behave like endophenotypes in families and are linked to ADHD symptoms in the population, extending evidence that ADHD behaves as a continuous trait in the population. Future longitudinal studies should clarify individual lifespan trajectories that lead to nonsignificant findings in adolescent and adult groups despite the presence of an ADHD diagnosis

    Evidence for similar structural brain anomalies in youth and adult attention-deficit/hyperactivity disorder: a machine learning analysis

    Get PDF
    Attention-deficit/hyperactivity disorder (ADHD) affects 5% of children world-wide. Of these, two-thirds continue to have impairing symptoms of ADHD into adulthood. Although a large literature implicates structural brain differences of the disorder, it is not clear if adults with ADHD have similar neuroanatomical differences as those seen in children with recent reports from the large ENIGMA-ADHD consortium finding structural differences for children but not for adults. This paper uses deep learning neural network classification models to determine if there are neuroanatomical changes in the brains of children with ADHD that are also observed for adult ADHD, and vice versa. We found that structural MRI data can significantly separate ADHD from control participants for both children and adults. Consistent with the prior reports from ENIGMA-ADHD, prediction performance and effect sizes were better for the child than the adult samples. The model trained on adult samples significantly predicted ADHD in the child sample, suggesting that our model learned anatomical features that are common to ADHD in childhood and adulthood. These results support the continuity of ADHD’s brain differences from childhood to adulthood. In addition, our work demonstrates a novel use of neural network classification models to test hypotheses about developmental continuity

    Analysis of structural brain asymmetries in attention-deficit/hyperactivity disorder in 39 datasets

    Get PDF
    Objective Some studies have suggested alterations of structural brain asymmetry in attention-deficit/hyperactivity disorder (ADHD), but findings have been contradictory and based on small samples. Here, we performed the largest ever analysis of brain left-right asymmetry in ADHD, using 39 datasets of the ENIGMA consortium. Methods We analyzed asymmetry of subcortical and cerebral cortical structures in up to 1,933 people with ADHD and 1,829 unaffected controls. Asymmetry Indexes (AIs) were calculated per participant for each bilaterally paired measure, and linear mixed effects modeling was applied separately in children, adolescents, adults, and the total sample, to test exhaustively for potential associations of ADHD with structural brain asymmetries. Results There was no evidence for altered caudate nucleus asymmetry in ADHD, in contrast to prior literature. In children, there was less rightward asymmetry of the total hemispheric surface area compared to controls (t = 2.1, p = .04). Lower rightward asymmetry of medial orbitofrontal cortex surface area in ADHD (t = 2.7, p = .01) was similar to a recent finding for autism spectrum disorder. There were also some differences in cortical thickness asymmetry across age groups. In adults with ADHD, globus pallidus asymmetry was altered compared to those without ADHD. However, all effects were small (Cohen’s d from −0.18 to 0.18) and would not survive study-wide correction for multiple testing. Conclusion Prior studies of altered structural brain asymmetry in ADHD were likely underpowered to detect the small effects reported here. Altered structural asymmetry is unlikely to provide a useful biomarker for ADHD, but may provide neurobiological insights into the trait
    corecore